Systolic architectures for connected speech recognition
نویسندگان
چکیده
منابع مشابه
Hybrid architectures for speech recognition
The state-of-the-art automatic speech recognition (ASR) systems utilize a statistical pattern recognition framework called HMM/GMM (Hidden Markov Model / Gaussian Mixture Model) with short time spectral features such as Mel Frequency Cesptral Coefficients (MFCC) or Perceptual Linear Prediction (PLP). Although this approach has been shown to be effective in capturing speech patterns, recent perf...
متن کاملA Systolic FPGA Architecture of Two-Level Dynamic Programming for Connected Speech Recognition
In this paper, we present an efficient architecture for connected word recognition that can be implemented with field programmable gate array (FPGA). The architecture consists of newly derived two-level dynamic programming (TLDP) that use only bit addition and shift operations. The advantages of this architecture are the spatial efficiency to accommodate more words with limited space and the ab...
متن کاملMandarin connected digits recognition for whispered speech
In this paper, the acoustic characteristics and recognition of whispered speech are discussed. A Mandarin digits database is built both in normal speech and whispered speech. The collected speech materials of normal and whispered speech are analyzed to verify the characteristics and differences for the two kinds of speech. Cross recognition is carried out using normal and whispered speech as tr...
متن کاملHybrid SVM/HMM architectures for speech recognition
In this paper, we describe the use of a powerful machine learning scheme, Support Vector Machines (SVM), within the framework of hidden Markov model (HMM) based speech recognition. The hybrid SVM/HMM system has been developed based on our public domain toolkit. The hybrid system has been evaluated on the OGI Alphadigits corpus and performs at 11.6% WER, as compared to 12.7% with a triphone mixt...
متن کاملSpeech Recognition Architectures for Multimedia Environments
Computer workstations have recently become powerful enough to support speech recognition entirely in software, but speech recognizers still vary in their functionality, and each vendor offers their own programmatic interface. Developing recognition applications currently means writing to non-portable protocols. As new improved recognizers become available, such applications will need to be rewr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Acoustics, Speech, and Signal Processing
سال: 1986
ISSN: 0096-3518
DOI: 10.1109/tassp.1986.1164918